Pruning the Search Space of the Wolof LFG Grammar Using a Probabilistic and a Constraint Grammar Parser
نویسنده
چکیده
This paper presents a method for greatly reducing parse times in LFG by integrating a Constraint Grammar (CG) parser into a probabilistic context-free grammar. The CG parser is used in the pre-processing phase to reduce morphological and lexical ambiguity. Similarly, the c-structure pruning mechanism of XLE is used in the parsing phase to discard low-probability c-structures, before f-annotations are solved. The experiment results show a considerable increase in parsing efficiency and robustness in the annotation of Wolof running text. The Wolof CG parser indicated an f-score of 90% for morphological disambiguation and a speedup of ca. 40%, while the c-structure pruning method increased the speed of the Wolof grammar by over 36%. On a small amount of data, CG disambiguation and c-structure pruning allowed for a speedup of 58%, however with a substantial drop in parse accuracy of 3.62.
منابع مشابه
Studying impressive parameters on the performance of Persian probabilistic context free grammar parser
In linguistics, a tree bank is a parsed text corpus that annotates syntactic or semantic sentence structure. The exploitation of tree bank data has been important ever since the first large-scale tree bank, The Penn Treebank, was published. However, although originating in computational linguistics, the value of tree bank is becoming more widely appreciated in linguistics research as a whole. F...
متن کاملApplying an Lfg Parser in Coreference Resolution: Experiments and Analysis
In this paper, we explore how LFG analyses as produced by the XLE parser with the English ParGram grammar can be used in a probabilistic coreference resolution system. So far, such systems have mainly relied only on information from surface-based NLP tools, reaching reasonable levels of performance while requiring only small amounts of training data. We compare these surface-based approaches wi...
متن کاملValency Change and Complex Predicates in Wolof: an Lfg Account
This paper presents an LFG-based analysis of Wolof valency-changing suffixes found in applicative and causative constructions. The analysis addresses the particular issue of applicative-causative polysemy in this language. Similar to the work for Indonesian (Arka et al., 2009), I adopt an LFG-based predicate composition approach of complex predicate formation (Alsina, 1996; Butt, 1995), and ext...
متن کاملTowards data - intensive testing of abroad - coverage LFG grammar Jonas
This paper addresses the problem that manual checking of output representations becomes impracticable in extensive tests during grammar development or in data-intensive applications of the grammar, like grammar-based lexicon acquisition from corpora. A method of annotating the sentences to be parsed with target expressions is proposed, using the LFG formalism itself to specify the expressions, ...
متن کاملFeature Engineering in Persian Dependency Parser
Dependency parser is one of the most important fundamental tools in the natural language processing, which extracts structure of sentences and determines the relations between words based on the dependency grammar. The dependency parser is proper for free order languages, such as Persian. In this paper, data-driven dependency parser has been developed with the help of phrase-structure parser fo...
متن کامل